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Abstract 

We formulate the planar 'large N limit' of matrix models with a con- 
tinuously infinite number of matrices directly in terms of U (N) invariant 
variables. Non-commutative probability theory, is found to be a good lan- 
guage to describe this formulation. The change of variables from matrix 
elements to invariants induces an extra term in the hamiltonian,which 
is crucual in determining the ground state. We find that this collective 
potential has a natural meaning in terms of non-commutative probability 
theorydt is the 'free Fisher information' discovered by Voiculescu. This 
formulation allows us to find a variational principle for the classical theory 
described by such large N limits. We then use the variational principle 
to study models more complex than the one describing the quantum me- 
chanics of a single hermitian matrix (i.e., go beyond the so called D — 1 
barrier). We carry out approximate variational calculations for a few 
models and find excellent agreement with known results where such com- 
parisons are possible. We also discover a lower bound for the ground state 
by using the non-commutative analogue of the Cramer-Rao inequality. 
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1 Introduction 



The purpose of the present paper is to find the natural mathematical language 
for describing the 'large N limit' of matrix models involving a continuously 
infinite number of matrices (in the Hamiltonian picture) e.g matrix quantum 
mechanics and matrix field theories. We find that non-commutative probabil- 
ity theory offers such a language, which after suitable generalizations can be 
adopted for the purposes of formulating the large N limit. Recasting continuum 
matrix models in this language allows us to find a variational principle for cal- 
culating the gauge invariant observables of such models. The advantage of the 
variational method being that we can use it to make well controlled approxima- 
tions when exact solutions are no longer possible. 

No apologies are really necessary for the study of continuum matrix models 
as they are ubiquitous in the world of both gauge theories and strings. QCD, 
where the gluonic field belongs to the adjoint representation of the struc- 
ture group, is perhaps the most natural example of a field theory where the 
dynamical degrees of freedom are matrix valued. It is believed that the planar 
large N limit is the correct simplification to consider while trying to under- 
stand non-perturbative phenomena such as confinement, understanding which 
in the physically interesting case of four spacetime dimensions remains one of 
the major unsolved problems of theoretical physics. QCD when dimensionally 
reduced to one dimension is known to provide a regularization of the theory of 
the classical membrane, and the generalization of this method to the case of 
the supermembrane provides one with the Hamiltonian which is conjectured to 
be a description of M-theory in the infinite momentum frame ^] . Other exam- 
ples of one dimensional matrix models include those with a single matrix (with 
polynomial self interactions) depending continuously on a parameter (which is 
usually taken to be physical or stochastic time). Such models are known to 
provide explicit realizations of string field theories in one dimensional target 
spaces g [|. 

In a previous paper[^| we studied 'static' matrix models, involving the inte- 
gral over a finite number of hermitean matrices: 




(1) 
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It is known that the theory of 'free random variables' jljhas a lot to do with 
the large N limit of these models |], ^, fj], |j. The mathematician's notion of 
'freeness' |j| |l0|, 11, |l2j and the physicist's picture of the planar large N limit 
are intimately related. The large N limit of matrix models was formulated in 
terms of a variational principle. There is an anamolous term in this variational 
principle, which wc showed, has a simple meaning in terms of non-commutative 
probability theory: it is the free entropy of Voiculescu. 

In the present paper we will extend our study to 'dynamical' matrix models, 
determined by hamiltonians such as 

In the path integral formulation of quantum mechanics this leads to integrals 
such as 

- J[K is tr Aa ]+ V(A)}dt tr ^ {h) , . . AMVA _ (3) 



Starting from the quantum theory at finite 'N', there has been considerable 
progress in extracting the large N limit, which is described by a classical Hamil- 
tonian. The 'Collective Field' formalism of Jevicki and Sakita (b| [jj, [l|, 
and the 'coherent state' picture of Yaffe jl8| stand out in this area. The idea is 
to change variables from the matrix elements to the invariants of the matrices, 
because quantum fluctuations in these invariants become small in the large N 
limit. Thus the theory when expressed in terms of invariants should be a clas- 
sical theory. One of the key lessons of their work is that this large N classical 
limit differs in many deep ways from the more familiar classical limit as h — > 0. 
Most striking is the appearance of certain universal terms in the gauge invariant 
formulation of the Large N theory (e.g. The (f> 3 term in the collective potential 
of Jevicki and Sakita). 

To understand the origin of these terms, let us consider a toy model from el- 
ementary quantum mechanics: a rotation invariant system whose configuration 
space is R N and has hamiltonian operator 

k = - n2 d^Q-l + V ^i r = V[x i x i }. (4) 

If we consider only wavefunctions that are rotation invariant ( zero angular 
momentum), the angular variables can be eliminated and the hamiltonian can 
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be expressed as a differential operator in the invariant variable r. The simplest 
way to perform this change of variables is to note that 



h z d i iP*d i i> + V(r)\iP\ z \d™x = C N J \h'W\ z + V(r)WJ r"- l dr (5) 

where CVr^ -1 is the area of the sphere of radius r. This factor is the jacobian 
of the nondinear change of variables x l — > r. Even if we make the change of 

N — 1 

variables to the 'radial wavefunction' \P(r) = r^~ip(r) to absorb this jacobian 
factor, there will be some derivatives of the jacobian that survive. After an 
integration by parts, we get 

o(N-l)(N-3) 



C N J ft 2 |*'| 2 + {? 



4r 2 



V{r) |* 



2 



dr (6) 



Thus, the radial hamiltonian is not just —K + V(r), but is instead, 

(7) 

The extra term is a sort of 'centrifugal potential' arising from differentiation 
of the volume of the sphere of radius r. It is at first surprising that such 
a centrifugal potential is present even when the angular momentum is zero: a 
phenomenon with no analogue in the classical theory. Indeed, this term vanishes 
as Ti — > 0. This term is universal in the sense that it is independent of the choice 
of potential: it arises from rewriting the kinetic energy in radial variables. 

To complete the story of this toy model, we see how to take the large N 
limit. In this limit we should expect that 

P 2 = j-jXiXi (8) 

should have small fluctuations. (This is a kind of central limit theorem.) Thus 
expressed in these variables, the theory should approach a classical limit. This 
classical limit will be very different from the more obvious limit h — > 0. In 
particular, the ground state can be very different. The hamiltonian is 



If we define H = $L , V(r) = ±V(Nip) we get 
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If we define 

we get the commutation relations 

k.P] = -^ (12) 

and the hamiltonian 

H = -h\ 2 + V coU (p) + V(p), (13) 

where V co u = - ^ N ~^i N ~ 3 ^ j^i Now we can take the limit as N — > oo keeping 
V(p), h fixed. The collective potential tends to 

V coll (p) - ^$(p), <&(/>) = ^- (W) 

The quantum fluctuations in p are indeed small because the commutator [w, p] ~ 
■^r. In the limit we get a classical hamiltonian 

H = hV + ^*(p) + V(p) (15) 

with the canonical Poisson bracket relations 

{p,p} = l. (16) 

The ground state of this system is given by p = 0, p = po where po is the 
minimum of 

^Hp) + V(p). (17) 

In the conventional classical limit % — > keeping N fixed, the ground state 
would have been at the minimum of V(p). For example, if V(p) = ^u 2 p 2 , it 
has minimum at p — with vanishing ground state energy. On the other hand, 
the ground state of the large N limit is concentrated at 



Po 



(18) 



.\/2< 

with E gs = 77= . Thus the large N limit better captures the zero point fluctua- 
tions of this system. 

The passage from matrix elements to invariants of a collection of matrices is 
analogous to the above change of variables from cartesian to radial co-ordinates. 
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There is a jacobian associated with this nonlinear change. Transforming to 
'radial wavefunctions' to absorb this will again induce an extra term in the 
potential, involving the derivative of the jacobian. These universal terms enable 
the Large N limit to capture features of the theory that the usual perturbation 
expansion misses. They are crucial in determining the ground state of the matrix 
model. Is it possible to understand them directly in the large N limit? 

In this paper we show that this universal term has a natural meaning in 
the world of non-commutative probability theory: it is the free analogue of the 
Fisher information obtained by Voiculescu |9|, |l0[ [ll], This continues our 
earlier work || where it is shown that the jacobian of the change of variables to 
invariants is just the exponential of free entropy. Information is the square of 
the gradient of entropy, which gives a rough explanation of our present result. 

We first motivate the connection between free probability theory and large 
N matrix models in the context of static models, explaining some concepts from 
non-commutative probability theory that appear not to be widely known in the 
physics literature. Then we carry out this reformulation for the problem of 
matrix quantum mechanics (or field theory at D (dimension of spacetime) = 1). 
This allows us to recover the results of Jevicki-Sakita jl^, [TtJ . Certain 

points of view on non-commutative probability theory generalize beyond D = 1 , 
and we use them to formulate a variational principle for general matrix models. 
In the last part of the paper, we apply this principle to some specific examples 
to get approximate answers for the ground state of matrix field theories with 
D > 1, where explicit solutions are not possible. 

2 Non-Commutative Probability and Static Ma- 
trix Models: 

Recall j|, P that in non-commutative probability theory, the analogue of a prob- 
ability distribution is a rule that gives an expectation value of a polynomial in 
the basic random variables. More precisely, consider the free associative algebra 
7m = C(Xi, • • • , Xm) generated by the hermitian operator valued random vari- 
ables Xi over the complex numbers. We think of the elements of Tj\/ as formal 
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power series]]] f(X) = f n '" ln Xi 1 ...i n — f 1 Xj. The joint probability distribution 
of the Xi in a tracial state is specified by the expectation values Gj =< Xi >, 
which are cyclically symmetric tensors ('moments') satisfying 

G <t> = l, G*j = Gj, G u f r f J > for any polynomial f(X) (19) 

where I is the string with the opposite order. The last condition is the positivity 
of the 'Hankel matrix' Gi.j = Gij. It is used to construct a metric on the space 
of probability distributions. G i,j Gjk = 8% denotes the inverse. 

If there were only one such random matrix X ( i.e., M = 1), it would be 
possible to introduce a probability density function on the real line po{x) such 
that 

G„ = ( PG {x)x n dx. (20) 



Many interesting quantities (such as entropy or information) have simple for- 
mulas in terms of p(x). 

The main complication in passing to multimatrix models with M > 1 is that 
there is no longer any analogue of p(x). We have to think of non-commutative 
probability distributions indirectly in terms of the moments Gi or certain other 
cyclically symmetric tensors S 1 which contain equivalent data. 

The two sets of tensors S 1 and Gi are related by the 'factorized Schwinger- 
Dyson equations': 

S- hU *Gj lIj2 + V \=0, (21) 

where ??} = b\ llH G H G H . 

The motivation behind this definition is that the tensors S 1 can be thought of 
as providing a 'matrix model' for the non-commutative probability distribution: 

n^oo J dAe N tr S( - A ^ 

N tr Sf-A) 

Thus, we can think of J'^ N tr sw as a ( corrLmu tative) probability density 
function for the matrix elements of a set of N x N matrices Ai . The quantity 
tr S(A) is called the 'action' of the matrix model in the physics literature. This 
extends to the realm of random variables the usual idea that non-commutative 
variables can be realized as matrices with complex numbers as entries. In a 

1 We continue the notation in ||s| . An upper case letter such as / denotes a string of indices 
il • ■ -i n - Repeated multi-indices are summed; <f> denotes the empty string. 
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typical problem arising from quantum physics, the action S(A) is some simple 
polynomial and we are required to calculate the moments arising from it: i.e., 
solve the Schwinger-Dyson equations. 

Infinitesimal variations of a non-commutative probability distribution are 
vector fields L\ on the space of probability distributions, with a natural action 
on the moments given by, 

L}Gj = 5?"'G Jl ij a . (23) 

Algebraically, L\ are the derivations of the tensor algebra Tm corresponding to 
the infinitesimal changes of variables [6A]j — 5)Aj. These vector fields form a 
Lie algebra, with the following Lie bracket, 

[i},4]=^LS iIj2 -^Li lJl2 . (24) 

For the case of the one matrix model, when the strings are labeled by integers, 
this algebra reduces to the Virasoro algebra (without the central term). 

The quantity r\\ appearing in the Schwinger-Dyson equation generates the 
first cohomology of the Lie algebra||, |2^]: it can be viewed as the variation 
f]\{G) = L l jx(G) of some function x(G) which, however cannot be expressed 
as a formal power series in the moments. Thus it is possible to regard the 
Schwinger-Dyson equation as the condition for a function 

fl(G) = S I Gi + X {G) (25) 

to be stationary: L l jD, = 0. This variational principle is useful because it allows 
for approximate solution of the SD equations: it is rare that they can be solved 
exactly. 

This quantity x(G) has a natural meaning in non-commutative probability 
theory: up to an additive constant, x(G) is the free entropy in the sense of 
Voiculescu ||]. It is the logarithm of volume of the set of N x N hermitian 
matrices whose moments are Gi in the N — > oo limit pl[ . 

To obtain a more explicit formula for x, we have to introduce a third way 
of describing a non-commutative probability distribution: a transformation (f> : 
Ai i * 4>{A)i — 4>l Aj that maps given distribution to a reference distribution, 
i.e. <f) : G — > F. Under such a change of variable, x transforms as a 1-cocycle, it 
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is incremented by the expectation value of jr? logdet of the Jacobian of <j>. We 
do not follow this direction here, as it is explained in an earlier paper J8|. 

For a single hermitian random variable, there are simple explicit formulas 
for entropy 

\{G) = V dxdy p G (x) p G {y) \og\x - y\ 



dxdy pr{x) p T (y) log f^M] + x (r) (26) 

where pr{x) = pg(</)(x))(/)' (x). 

It is also worth recalling, that for the case of a single random matrix, the 
Schwinger-Dyson equation can be written in a simpler form using p: 

(Hp)(x) = ±-S'(x). (27) 

(This is called the Mehta-Dyson integral equation.) Here (Hp){x) — j dy^^ 
is the Hilbert transform. 

The expression r\\ = Sl^^G^G^ motivates the introduction of the operator 
d [ flO| ], the difference quotient gradient , whose components are 

d j : T M -> T M ® T M ; &Xj = 5^ h X h ® X h . (28) 

The partial difference quotient can be thought of as acting on a product of 
matrices. It breaks the matrix product at each occurrence of Xj, exposing two 
more matrix indices, thus producing an element in the tensor product. For 
M = 1, d<fi(x) = ^^IZy^ ■ ® relates the non-commutative Hilbert transform to 
the variation of entropy. 

The analogue of the difference quotient, when acting on a trace of a product 
of matrices rather than on a product of matrices, leads to the cyclic gradient 5 

S l : T M -» r M - 5*Xj = S^Xj.^ . (29) 

The ordering of multi-indices is explained by the matrix analogy. For a single 
generator, the cyclic gradient reduces to the partial derivative. The cyclic gradi- 
ent of a polynomial is unchanged by cyclic permutations of any of its coefficients: 
Let S(X) = 5 ,J X / . Then 

8 k S(X) = S Ilkl2 X hh = S {kK) X K (30) 
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where = S fe i- fc ™ + S kmkl - k <»- 1 + ■■■ + S k *- kmkl . 

The formalism developed above allows us to get a formula for the Hilbert 
transform of a collection of non-commutative random variables. We first note 
that for a single hermitian variable, the Mehta-Dyson equation Hp(x) = ^S'(x) 
tells us that the Hilbert transform is the derivative of the 1-matrix model action. 
For several hermitian random variables, the Hilbert transform is a vector H l {X). 
The appropriate generalization of the 'derivative' of the action S I Aj, must 
depend only on the cyclic part of the tensors S 1 . The natural candidate is the 
cyclic gradient 



H\X) = ^S(X). (31) 



Using the definition of S l 



H\X) = ±S K ^X K2Kt = ^-S n X! (32) 

Z7T Z7T 

The formula for the Hilbert transform given above involves the action S(X). 
One can look for a formula for H l (X) in terms of the moments of the matrix 
model. Using the factorized SD equations in the form S 1 " 1 — rjjG 1 '' we get 

H\X) = = ^-G h GuG IM X.j. (33) 

Z7T Z7T 

Obtaining the inverse Hankel matrix from the moments is about as hard as 
getting the moments of a static matrix model given its action. This formula 
may be written 

L\ X ^2n(W(X)X I ) (34) 

thereby relating the Hilbert transform to the variation of entropy. Voiculescu 
used this as the defining property of the Hilbert transform M] . 

Another quantity from the world of non-commutative probability theory that 
is going to be important for our understanding of matrix models is the notion 
of free Fisher information. Following Voiculescu, let us first consider Fisher 
information for a single real random variable /: 

*(/) = hm + -*/). (35 ) 

Here g is Gaussian with unit covariance, / and g are statistically independent 
and <S(/) is Boltzmann's entropy of /. If / has pdf p(x), then 

S{f)= - ( p{x)\ogp{x)dx (36) 
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Regarding $ as the zero time derivative of a Gaussian process starting at /, the 
heat equation gives 

$(/) = Jdx p(x) (dJogp(x)\ (37) 

For several real random variables Xi with joint distribution p(x), the Fisher 
information matrix is defined as 

$ ij '= y > dxp(x)<9 i logp(x)<9 J 'logp(x) (38) 

By analogy, the free Fisher information of a single hermitian random variable 
may be defined as Q : 

= hm x(X + Vtg)-xW 
t-»o+ i 

where 5 is the Wigner semi-circular distribution. If X + v / i5' has pdf p(x,t), 
then x(X + \J~tS) = V J p(x, t)p(y, t) log \x — y\dxdy is the free entropy. The 
complex Burger equation 

dG dG , , 

m +G ^-° (40) 

for the Cauchy transform G(z,t) = J £fegjg ; 2 ^ supp(p) plays the role of the 
heat equation for a semi-circular process and leads to 

$(X) = 27r 2 | ((^(x)) 2 p(x) dx= ^fj dx (p(x)f. (41) 

By analogy with the case of several real random variables, we define the free 
Fisher information matrix for several non-commuting random variables as 

& ] (G) = 2tt 2 ( H* {X)W {X) ) (42) 

$ 4J '(G) cannot be expressed solely in terms of the moments. As with the Hilbert 
transform, we can express it in terms of the Gk and either the inverse Hankel 
matrix G /;,/ or the action S 1 Xi whose large N moments are Gk- 

& j {G) = ^G I -WiV 3 i = \G Illl2 ^ j2 G Il G l2 G Jl G, h 

$ y '(G) = ^S U S 3J G U . (43) 

The equivalence of these two follows from the factorized SD equations relating 
S 1 to G K : 

S u = VjG IJ . (44) 
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For a single generator this formula reduces to the familiar one when we use the 
Mehta Dyson equation. 

= ^5Z(m + l)(n + l)S ro+ iS n+ iG Tn+ „ 

m.n 

= 2tt 2 J {{Hp){x)f p(x) dx (45) 

3 Dynamical Matrix Models and Collective Field 
Theory 

Now let us turn to the main object of our interest, i.e. matrix models in the 
Hamiltonian approach. To have a definite example in mind we consider in some 
detail, the quantum mechanics of a discrete number of hermitian matrices. In 
other words, we consider dynamical multi-matrix models whose Hamiltonians 
are of the form 

& =- K 4 mmm +tv[VlAl] (46) 

Here Ai (t) are N x N hermitian matrices, whose time evolution is described by 
the Hamiltonian above. The positive tensor Kij determining the kinetic energy 
is usually just h 2 5ij. The ground state energy is 

^= lim min (V>TO> (47) 

where the wave function ip(A) is invariant under the adjoint action of the unitary 
group; Ai \— ► UAiW Vi and the inner product is with respect to the Lebesgue 
measure on matrix elements. 

The planar large N limit of the above theory is the limit N — > oo holding 
V 1 = N^ 1 ^ 1 ^ 1 and fixed. Moreover, only wavefunctions that are invariant 
under U (N) arc allowed. It will be convenient to make the rescaling Ai — * NAi 
so that 

£ = H = -^4^1 + Jr tr [ylAl] - (48) 

It is known that in this limit, quantum fluctuations in the moments Gj = 
jj tr Aj become small. Hence there ought to be some classical mechanical sys- 
tem that is equivalent to this limiting case of the matrix model. The configura- 
tion space of this classical mechanical system would be the space of probability 



12 



distributions. It is our task now to determine the hamiltonian and Poisson 
brackets of this theory and use them to gain insight into the properties ( such 
as the ground state) of the matrix model. 



3.1 The case of One Degree of Freedom 

Such a reformulation is most transparent in the case where the time evolution 
of a single hermitian matrix is concerned, 

*H*^^ + X>^ B - (49) 



If we rescale A — > NA this becomes 



H i h 2 



5>„ltr^ (50) 



N 2 2N 4 dA£dA b a ^ n N 

where V n = N n ~ 1 V n . We are interested in determining the properties ( such as 
the ground state) of the system in the limit as N — > oo keeping V n and Ti fixed. 

We quote below the result obtained by Jevicki and Sakita (l4[ |l6[ 0, who 
changed variables to the eigenvalue density p(x) of A and showed that in the 
large TV limit, the Hamiltonian may be written as 

= H = K r + Vcou + J dx V(x) p{x) 
where V(x) = ^V n x n . (51) 

n 

The collective potential and 'radial' kinetic energy are 

VcoU = fl 2 ^ J dx p(xf =^ 2 y J {{Hp){x)) 2 P (X) dx 

K r = —h 2 J dx tt' (x) p(x) 7r'(x) 

i $ % 
where7r(a;) = -j^j—-;[Tr{x),p(x)]=-^6(x-x). (52) 

We notice that their collective potential V co a (the term that really makes the 
Hamiltonian describing the large N limit different from the finite N case) is just 
one fourth the free Fisher information of one hermitian random variable. 

This reformulation of the matrix model allows us to take the limit N — > oo 
easily. Note that N only appears in the commutation relations between p and n. 
Indeed, as N — > oo this commutator becomes small: appears where h appears 

2 We use the term radial by analogy with atomic physics 
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in the Heisenberg relations. Thus TV — > oo is a kind of classical limit. The 
limiting theory has as configuration space the space of probability distributions 
p(x) . The canonical conjugate of p{x) is p(x): 

{p(x),p(x')} = 5(x - x 1 ) (53) 

The hamiltonian 

H=^Jdx fi 2 p l2 {x) p(x) + V coU { P ) + j V{x)p{x)dx (54) 

leads to the equations of motion 

dp(x,t) „ 1 , l0 , N n 2 h 2 , s0 

-^f 1 = E--h 2 p' 2 {x)-—p{xf-V{x), 

9p(x, t) _ ^ 2 d[p(x)p'(x)} 

where E is the Lagrange multiplier imposing the constraint J p{x)dx = 1. The 
spectrum (density of eigenvalues) of the matrix A in ground state of the matrix 
quantum mechanics is then determined in this large N aproximation by the 
static solution 

Pi*) = [2 (E - V(x))] 9 (E - V(x)) . (56) 

As noted above, the constant E is determined by the condition J p(x)dx = 1. 

Notice that the collective potential V co ii plays a crucial role in determining 
this answer; without that term there would have been no such static (ground 
state) solution. The elimination of the angular degrees of freedom induces a 
centrifugal potential ( collective potential V co u) leading to a repulsion of eigen- 
values of A. The volume of the set of matrices of a given spectrum vanishes 
when any pair of eigenvalues coincide. The collective potential is the square of 
the derivative of the logarithm of this volume, so is singular when a pair of eigen- 
values coincide. This repulsion of eigenvalues counterbalances the tendency of 
the eigenvalues to accumulate at the minimum of V(x). It is this balance that 
leads to a continuous density of eigenvalues. 

In the more conventional limit U — ► (keeping N fixed) we would get a 
completely different answer: the ground state is concentrated at the minimum 
of V(x). 
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3.2 Several Degrees of Freedom 

Once we see the connection of the collective potential to the free Fisher information, 
commutative probability theory immediately suggests the generalization to multi- 
matrix models. (We will describe the answer first, postponing its derivation to 
the next subsection. )The probability density p(x) no longer makes sense with 
several random variables; instead we must think of the moment tensors Gj 
as characterizing the non-commutative probability distribution. This way of 
thinking leads to 

H = ^Kij^Gun^ + ^K ij $v(G) + V I G I 

** = ^ = 'Mn ^ 

where 6^) = 1 if there is a cyclic permutation of I that equals J and zero 
otherwise. We will now show by an explicit change of variables that this is 
indeed the correct hamiltonian of the large N limit. It is convenient to introduce 
the effective potential 

V eff = -K^ + V I Gi (58) 

Thus, in the planar limit N — > oo keeping V 1 , Kij fixed we do get a classical 
mechanical system whose configuration space is the space of non-commutative 
probability distributions with Poisson brackets 

{G It p J } = 5$ (59) 

and the hamiltonian 

H = ±K ij p iI G IJ p iJ +^K ij &i(G) + V I G I (60) 

determine the equations of motion 
dG K 



dt 



= {G K ,H} = {G K ,K r } 

■K Illl2 Gl 2 I lJ2Jl + Gl 2 I 1 .] 2 J 1 TT Iltl2 



-KS {Jljj2) 



d-K 



K 



dt 



- 2 ZJ WW.) 1 n + q Gk 



(61) 
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An interesting class of static solutions is determined by 

tt k = V \K\ > 1 

= V > 1 (62) 

Thus, to find the ground state we must minimize V e tf. Note that this vari- 
ational principle involves no approximation aside from the large N limit. The 
equations for extremizing V e ff are, L k K {jKij^ + V I Gj) = 0. The chief obsta- 
cle in solving these equations explicitly is the presence of the Fisher information 
term, which does not have any simple expression in terms of the moments. Of 
the two forms of the information matrix in eqn. (|45|), we choose to use the 
one involving the auxiliary variables S^'s in carrying out the minimization of 
the effective potential. Let us note here, that these variables S 1 (G) are to be 
thought of as functions of the moments themselves, i.e. they are the solution 
to the Schwinger-Dyson equations of some (static) matrix model, whose large 
N moments coincide with those of the dynamical matrix model at a particular 
value of time. In terms of the auxiliary variables S 1 (G) the equations to be 
solved are: 



L k K V eff = \K^ J 



2LWj ~ S lI L k K Gjj 



V Ilkl2 Gi 1 Ki 2 = 0. (63) 



3.3 A geometric derivation of the collective Hamiltonian 

We will study the quantum mechanical system whose degrees of freedom are 
described by a set of hermitian matrices (Ai)% and has hamiltonian 

g= "^ 4(w 3) > ^ tf ^ 1 ' (64) 

The eigenvalue problem of H follows from the condition that the quadratic form 

be extremal, subject to the constraint on the norm < ip\ip >= J \ip(A)\ 2 dA = 1. 

The kinetic energy term is just the laplacian with respect to a euclidean 
metric on the space A of hermitean matrices: 

~g{d[MtAH%) = ~9iW = K l3 S a d S c b (66) 
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This metric is invariant under the action of U(N) on the matrix variables: 
Ai — > UAiU^ 1 . Moreover, the potential energy tr V 1 Ai is invariant under 
U(N). We will regard this as a 'gauge transformation', and allow only for 
wavefunctions that are also invariant under U(N): 

tPiUAU^) = ij)(A). (67) 

The problem is to determine the eigenvalues and eigenfunctions of the hamilto- 
nian H over this class of invariant wave functions. Feynman diagram methods 
applied to this problem [[l9f suggest that there is an important simplification in 
the limit N —* oo holding V 1 and K%j fixed: only Feynman diagrams of planar 
topology contribute. This in turn implies that the gauge invariant observables 
such as Gi = tr Aj have fluctutaions of the order : thus matrix models 
reformulated in terms of invariants tend to some classical theory in the large N 
limit. What are the configuration space, Poisson brackets and hamiltonian of 
this classical theory? 

A complete set of invariants for the matrices are given by Gi = tr Aj. For 
a fixed finite N, only a finite subset of the Gj can be independent variables. If we 
allow N to be arbitrary ( may be even infinite) the Gj are independent except for 
cyclic symmetry, the reality conditions and inequalities. Thus the configuration 
space of matrix models is nothing but the space of non-commutative probability 
distributions that are tracial: 

G = 1, G*j = Gj, Guf*f J > for any polynomial f(X) (68) 

In essence our task is to change variables from cartesian co-ordinates Af b to 

'radial co-ordinates' Gj. 

It is easier to change variables in the quadratic form < ip\H\tp > rather 

than the hamiltonian operator H directly: we only have to deal with first order 

rathar than second order derivatives. By the chain rule of differntiation, 

v dr cty _ v dGj dGj dj>* dj> 

Kl3 dA? b dA) a ~ K ^ dAl dM 8G, dGj ' (b9j 



Now, 



dGi 

dd dGj _ x i 1 u 3J .j 1 jj 3 



d[A t ] a b d[A 3 t 



= S^iAj^t 

5 ^H^^ Gl2lM (70) 
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so that 

dtp* dtp dtp* dip 

where 



9l .j = K.jS^Sj^G^M (72) 

The geometrical meaning of this clear: gi-j is the inverse of the metric ten- 
sor induced on the space of non-commutative probability distributions by the 
euclidean metric on the space of matrices. 

To complete the story we need to understand as well the Jacobian of the 
transformation A, — ► G/. This problem was studied extensively in a previ- 
ous paper, where we showed that this Jacobian is e N x ^ where x(G) is the 
free entropy ( in the sense of Voiculescu) of the non-commutative probability 
distribution whose moments are G/. Thus we get 



e N\(G) dG (?3) 



^ N^ Gl N^ Gj +VlG ^_ 
and, of course also, 

< tp\tp >= J i>*ipe N2x(G) dG. (74) 
To take the limit as N — > oo , it is useful to introduce a new 'radial wavefunction' 

* = e^ N2x tP (75) 

so that 

<V#>= J ***rfG, J V I G I ip*ipe N2x( - G *>dG = J V 7 G/***dG. (76) 

In making this change of variables on the expectation value of kinetic energy, 
there will be some extra contributions from the derivatives of the entropy.These 
will add an extra term V co u to the potential. It is our next task to determine 
this term. Using 

dtp x N * x jw i d x { ) 



N 2 dGi N 2 dGi 2dGi 

we get 



dr dtp N 2 x{G) J_^lJ_^,l d X d X , |2 

9I;J N 2 dG T N 2 dGj 6 9I;J N 2 dd N 2 dG. ; 4 5/; J <9Gj dGj 1 1 



1 d X dm 2 

'Vd&WdG] (78) 
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An integration by parts gives 



dip* dj) N 2 x(G) 

9IJ N 2 dd N 2 dGj e 



1 d** 1 a* 
W 2 'dG~ I Yr 2 d~G~ 



lid 
"27V2 d~G~j 



J 



1 dx dx llTr , 



9i;J 



dx 



dGi 



4' 
*| 2 



dGi dGj 



+total divergence. 



(79) 



The first term leads directly to the 'radial' kinetic energy of eqn. (|57|). The last 
two terms don't involve derivatives of \E r and hence are contributions to the po- 
tential arising from the change of variables: they represent a sort of 'centrifugal 
potential' ( or 'collective potential' in the language of Jevicki-Sakita): 



1 dx dx 

Vcoll = o9l;J 



1 1 



8 



8 aJ ' J dGi dGj AN 2 dGj 



9l;J 



dx 



dd 



(80) 



In the large N limit the second term will be small compared to the first. In any 



case, we show that it vanishes. Using the formula for gi-j and 



dx 

dGj, 



G J > L vi, 



<9iJ—) = IJ^^—^ Sj G l2hJ2Jl —) 



AN 2 dGj yi> ' dGi AN 2 



1 v d ( d X \ _ 1 v dtf 
~ AN 2 13 dG u \ " dGjj J ~ AN 2 3 dGu 

since we are always differentiating with respect to a higher moment. Thus only 
the first term contributes to V co u 

v -k r dx ^ 

= ^G^j^G^^lG^^^ 

= \K l3 G^ M n y M (82) 

From our earlier discussion of the Fisher information we see that 

Vcou = \k^. (83) 

Now we can collect the different terms together to get the answer for the 
hamiltonian stated in eqn. (p7|). 
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4 Approximate Large N Ground State and the 
Variational Principle 



Our aim is to understand the ground state of the Hamiltonian 

H = K r + Veff (84) 

where K r = ^K ij ir I ^ I 'G IaIl j aJl Tr J ^ J ' and V eff = + V'Gj. The 

large N ground state is the probability distribution minimizing the effective 
potential V e ff = V co u + V I Gj. In essence, this is the problem of minimizing free 
Fisher information holding the moments in the potential fixed via the Lagrange 
multipliers V 1 . So far we have not made any approximations aside from the 
large N limit. 

Mean Field Theory: The equation for minimization of the effective po- 
tential also defines for us a variational principle. If wc minimize V e ff over a 
sub-manifold of the configuration space which corresponds to the choice of a 
few parameter family of actions for the static matrix model, we will get an up- 
per bound for the ground state energy. But as we well know, explicit solutions 
of the Dyson-Schwinger equations for arbitrary multi-matrix models are not al- 
ways possible. However we may chose as the action S some solvable model, 
which can then act as a variational ansatz. The simplest choice for the action 
is the gaussian, which leads to the wignerian sub-manifold of moments. This 
leads to an approximation method similar to mean field theory (MFT). As a 
first check on our methods, we present below mean-field calculations for some 
dynamical matrix models. 

Since the gaussian action for static matrix models leads to Wigner moments, 

$ y '(G) = ^S U (G)S 3J {G)G U = ^(G- 1 )^ (85) 

on the wignerian sub-manifold. Here (G _1 ) Jj Gjk = S l k . Let us specialize to 
matrix models with a quartic potential V^dj + V % ^ kl Gijki- Then the effective 
potential to be minimized is 

VeffiGij) = Ik^G- 1 ) 1 ^ + V ij Gij + V^iGijGkt + GuGjk). (86) 
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The minimum of the effective potential is given by, 

eff - — K ij {G- l ) ip (G- l y q + V pg 
8 

+V pqkl G k i + V im G l3 + V P]kq G jk + V ipql G u = 0. (87) 



9Gpq 



Notice that this is a cubically non-linear equation for the positive symmetric 
variational matrix Gy. Gy plays the role of a mean field and this is its self 
consistent equation. 

4.1 Mean Field Theory for Dynamical 1 Matrix Model 

To calibrate our approximation method, let us consider the dynamical 1 matrix 
model 

whose exact large N ground state energy is known from the work of Brezin et.al. 
[ p4[ . On the sub- manifold of Wigner semicircular distributions, 

Viff{G 2 ) = ^-+ 1 -G 2 +2gGt (89) 

whose minimum occurs at 

g?G0 = ^[-i + m- 1/3 + m 1/3 L 

where fj, = -1 + 216g 2 + 12gy/-3 + 324 5 2 (90) 
The semicircular estimate is a variational upper bound E gs < Eg C s . 

4.2 Cramer-Rao Inequality 

Somewhat surprisingly, we can also get a lower bound for E gs . For this we 
make use of the (non-commutative) Cramer-Rao (CR) inequality |^2|, ^3], |l2) 
along with the moment inequalities, which are conditions for positivity of the 
Hankel matrix. The CR inequality states that $ y — (G^ 1 ) 13 is a positive matrix. 
It is the statement that among all distributions with fixed 2nd moment, the 
Wigner distribution minimizes information. For one variable, it is the condition 
$ G2 > 1 which is understood as a sort of uncertainty principle in statistics. 
The moment inequalities (MI) for a single variable are 

G2 ^ 0, G4 5> G2, G2G4G6 — G^ -t- G^Gg — G^G^ ^ 0, • • • (9-0 
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We can saturate the MI by choosing G 2 > 0; G 4 = G%\ G 6 = G| ■ • •. The MI 
are saturated by p{x) = 5(x — Xq), G2n = Xq™, not the Wigner moments. 

The CR-MI estimate is a lower bound E^ S R ~ MI < E ga . We see from the fol- 
lowing comparison with Brezin et.al.'s exact (N — > oo) result, that in addition to 
providing bounds on E gs , mean field theory is a reasonably good approximation. 



g 


^gs 


Tpexact 

bg S 




.01 


.503 


.505 


.505 


.1 


.523 


.542 


.543 


.5 


.591 


.651 


.653 


1 


.653 


.740 


.743 


50 


1.794 


2.217 


2.236 


1000 


4.745 


5.915 


5.969 


9 -> oo 


.472 g 1 ' 3 


.590 g 1 / 3 


.595 g 1 ' 3 



4.3 Semi-Circular Ansatz for a Dynamical 2 Matrix Model 

We consider the two matrix model, that is of considerable interest in M-theory 
[Q(for simplicity we look at the non-supersymmetric counterpart of the 'BFSS 
model), defined by, 



H = 

where V(A,B) = 



If 



i:) 2 



tr 



1 



2\dA b a dAl dB b a dB% 



V{A,B) 



'-^-(A 2 +B 2 ) + ^(AB + BA) - ^[A,B] 2 



(92) 



We read off V« = 



\m 2 \c 



\,V 



ABAB 



= -V 



ABBA 



— — \g and consider 



112/'' 2i 

the region g > 0, |c| < m 2 where is positive. The effective potential to be 
minimized is 



1 



1 



1 



V eff = - tr [G ] + -m 2 {G AA + G BB ) + cG AB - -g(G ABAB - G ABBA ) (93) 



(a (5 s 

In terms of the positive Gij = 
sub-manifold of configuration space, 

Veff (a, (3) 



< a) parameterizing the Wignerian 



-I- m 2 a + c/3+ -g{a 2 - (3 2 ) 



(94) 



4(a 2 -f3 2 ) ' ~ ' 1 2 ; 
For the above range of parameters, V e ff is bounded below. Since the information 
and potential scale oppositely, it has a minimum at a non-trivial a > 0. For 
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c = 0, we get /? = and a cubic for a : 4ga 3 + Am 2 a 2 — 1 = 0. For c > the 
actual solution of the algebraic equations can be obtained numerically. 



4.4 MFT for Relativistic A0 4 Matrix Field Theory 

Next we consider a relativistic hermitian scalar matrix field <fr%(x) in d + 1 
dimensions. The Hamiltonian in the Schrodinger representation is 

X2 



• (95) 



Let G xv = %<f>(x)<j>(y), J d d yG xy G yz = 5 d (x - z). Within the mean field ap- 
proximation, the effective potential is 

V eff = ±f dxG xx + J dxdy l -5(x-y){-^+ml)G xv + 2\ {) J dx(G xx f. (96) 

Minimizing it leads to an integral equation for the mean field G xy 

- ^5(x -V) + \J dzG zx (—^ + ml)G zy + 4A J dzG zz G zx G zy = (97) 

Assuming a translation invariant ground state 



dd P „ip{* 



G xy =G(x-y)= I --J-e^-^G(p), (98) 



this equation can be solved: 



(27T) 



2 VP + m 



where m is determined self consistently by 

™ 2 = < + 4A ° f ( £ P v 7 f= 2 ( 10 °) 
J ( 27T r \Jv + m 

We recognize this as the mass renormalization; log, linearly and quadratically 
divergent in 1, 2 and 3 spatial dimensions. We need to go beyond the mean field 
ansatz to see coupling constant renormalization. We will address these issues in 
a separate paper. 
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